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DETAILED ACTION 

1 . This Office Action is in response to the RCE filed on 05/14/2008. Claims 1-3, 5, 
7-16, 19. and 20 remain pending and have been examined. The Applicants' remarks 
have been carefully considered, but they are not persuasive and do not place the claims 
in condition for allowance. 

2. All previous objections and rejections directed to the Applicant's disclosure and 
claims not discussed in this Office Action have been withdrawn by the Examiner. 
Further, it should be noted that since applicant did not traverse the examiner's assertion 
of official notice, the common knowledge or well-known in the art statement is taken to 
be admitted prior art because applicant either failed to traverse the examiner's assertion 
of official notice or that the traverse was inadequate (see MPEP 2144.03, C). 

Response to Arguments 

3. Applicant's arguments, see page 6, filed on 05/14/2008 with respect to the 
rejection(s) of claim(s) 1, 9, and 14 under Tackin (US 7,180,892) in view of Smith etal. 
(US 6,862,298) have been fully considered and are moot in view of new grounds for 
rejection. Accordingly, new references have been cited, Gentle in view of Dowdal. 

Claim Rejections - 35 USC § 101 

4. 35 U.S.C. 101 reads as follows: 



Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefore, subject to the 
conditions and requirements of this title. 
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Claims 14-20 are rejected under 35 U.S.C 101 because the claimed invention is 
directed to non-statutory subject matter. 

The statutory class of the claimed limitations in claim 14-20 do not fall under the 
statutory class of manufacture. The statutory class of "manufacture" deals with 
production of raw materials for use from raw materials. Hence, the claim as it stands 
falls under an incorrect statutory class. 

Claims 14-20 are drawn to a "article" and a "storage medium" perse as recited in 
the preamble and as such is non-statutory subject matter. Further, from the Applicant's 
Specification, it was described that the medium can be a communications medium or a 
computer readable storage medium as described in Applicants Specification, pages 4-5, 
last paragraph on page 4, continued into page 5, and page 10, lines 8-1 1 . From the cited 
portions, it is unclear whether the article is related to the communications media or 
computer readable media. See MPEP 2106.01 [R-5]. Data structures not claimed as 
embodied in computer readable media are descriptive material perse and are not 
statutory because they are not capable of causing functional change in the computer. 
See e.g., Warmerdam, 33 F.3d at 1361, 31, USPQ2d at 1760 (claim to a data structure 
perse held nonstatutory). Such claimed data structures do not define any structural and 
functional interrelationships between data and other claimed aspects of the invention, 
which permit the data structure's functionality to be realized. In contrast, a claimed 
computer readable medium encoded with a data structure defines structural and 
functional interrelationships between the data structure and the computer software and 
hardware components which permit the data structure's functionality to be realized, and 
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is thus statutory. It is advised that the Applicant change the preamble of the claim to 
recite a "computer readable storage medium including stored instructions that, when 
executed by a processor...." 

Claim Rejections - 35 USC §112 

5. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

6. Claims 14-20 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. From the Applicant's Specification, it was described 
that the medium can be a communications medium or a computer readable storage 
medium as described in Applicants Specification, pages 4-5, last paragraph on page 4, 
continued into page 5, and page 10, lines 8-11. From the cited portions, it is unclear 
whether the article is related to the communications media or computer readable media. 
For the purposes of compact prosecution the limitation was interpreted to mean a 
communications media. 

Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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6. Claims 1, 5, 7, 9, 13, 14, and 19 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Gentle et al. (US 2004/0073692) in view of Dowdal (US 7,346,005). 
As to claims 1 and 14, Gentle teaches a method, comprising: 

receiving a plurality of packets (see [0036], VAD monitors packet 
structures in incoming digital voice stream) with audio information (see Abstract, 
audio stream, also see [0036], voice) (e.g. Applicant defines audio information to 
include voice and silence (see page 4, [0006], lines 3-5). Audio packets are 
retrieved.); 

determining by a voice activity detector (see [0036], VAD 220) whether 
said audio information represents voice information (see [0036], VAD determines 
if voice activity is present) (e.g. The determination of the audio information is 
found by the voice activity detector 220.); and 

buffering said audio information in a jitter buffer (see Figure 3, buffer 
manager 330) after said determination (see [Figure 2, VAD 220). The reference 
also teaches the use of a computer entailing a computer readable medium for the 
above limitations (see [0030])) (e.g. Audio information is buffered.). 

wherein said determining comprises: 

receiving frames of audio information at a voice activity detector (see 
[0036], packet structures are received); 

measuring at least one characteristic (see [0051], [0036], and [0004], 
where the Reference discloses convention technique and shows an alternative 
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based on silence threshold to determine voice activity and energy level 
measurements) of said frames (see [0036], packet structure ) 

determining a start of voice information based on said measurements (see 
[0052], VAD 220 determines silence or nonsilence as well as beginning and 
endpoints); and 

determining an end to said voice information based on said (see [0052], 
VAD 220 determines silence or nonsilence as well as beginning and endpoints) 
and a delay interval (see [0051], timing measurement module used to determine 
jitter by VAD 220); and 

adjusting of the delay interval (see [0043], timing measurement module 
allows adaptive control of FIFO delay) 

However, Gentle does not specifically teach the adjusting of the delay 
interval based on an average packet delay time. 

Dowdal teaches the adjusting said delay interval to correspond to an 
average packet delay time (see col. 4, lines 33-60, delay between packets are 
calculated and a calculated running average is maintained in order to reset the 
value of the FIFO buffer for playout). 

It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have modified the voice based packet network as 
taught by Gentle with the use of a delay based on the average packet delay time 
as taught by Dowdal. The motivation to have combined the two references 
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involves the improvement in audio quality for effective playout of audio by 
minimizing jitter and delay (see Dowdal, col. 1, lines 15-21). 

As to claim 5, Gentle in view of Dowdal teaches all of the limitations as in claim 
1 , above. 

Furthermore, Gentle teaches said characteristic comprises an estimate of 
an energy level for said frame (see [0051], energy level measurement can be 
employed by VAD 220) (e.g. An energy level is used to determine if speech is 
present.). 

As to claims 7 and 19, Gentle in view of Dowdal teaches all of the limitations as 
in claims 1 and 14, above. 

Furthermore, Dowdal teaches measuring an average packet delay time by 
said jitter buffer (see Dowdal, (see col. 4, lines 33-60, delay between packets 
are calculated and a calculated running average is maintained in order to reset 
the value of the FIFO buffer for playout). 

Furthermore, Gentle and Dowdal teaches sending said average packet 
delay time (see Dowdal, col. 4, lines 33-60) to said voice activity detector (see 
Figure 2, acoustic prioritization agent 232, and VAD 220 and [0041]) (e.g. The 
prioritization agent communicated with VAD the packet beginning and endpoints 
for synchronization) 
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As to claim 9, Gentle teaches a system comprising: 

an antenna (see [0030], radio, telephone, wired analog, etc. )(e.g. It is 
inherent that digital phones consist of built-in antenna as well as a receiver for 
hearing audio information and transmitter for transmitting information. ); 

a receiver connected to said antenna (see [0030], radio, telephone, wired 
analog, etc. and see Figure 3, 228, receives information from first user and 
[0040]) to receive a frame of information (e.g. The receiver receives the packets 
of information from first user) 

a voice activity detector (see [0036], VAD determines if voice activity is 
present) to detect voice information in said frame see [0036], VAD monitors 
packet structures in incoming digital voice stream) (e.g. The determination of the 
audio information is found by the voice activity detector 220.); and 

a jitter buffer (see Figure 3, buffer manager 330) to buffer said information 
after said detection by said voice activity detector buffer (see Figure 2, VAD 
220)). 

wherein said voice activity detector receives frames of audio information, 
measures at least one characteristic of said frames (see [0051], [0036], and 
[0004], where the Reference discloses convention technique and shows an 
alternative based on silence threshold to determine voice activity and energy 
level measurements and (see [0036], packet structure ), determines a start of 
voice information based on said measurements (see [0052], VAD 220 
determines silence or nonsilence as well as beginning and endpoints), 
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determines an end to said voice information based on said (see [0052], VAD 220 
determines silence or nonsilence as well as beginning and endpoints) and a 
delay interval (see [0051], timing measurement module used to determine jitter 
by VAD 220), adjusting of the delay interval (see [0043], timing measurement 
module allows adaptive control of FIFO delay) 

However, Gentle does not specifically teach the adjusting of the delay 
interval based on an average packet delay time. 

Dowdal et al. teaches the adjusting said delay interval to correspond to an 
average packet delay time (see col. 4, lines 33-60, delay between packets are 
calculated and a calculated running average is maintained in order to reset the 
value of the FIFO buffer for playout). 

It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have modified the voice based packet network as 
taught by Gentle with the use of a delay based on the average packet delay time 
as taught by Dowdal. The motivation to have combined the two references 
involves the improvement in audio quality for effective playout of audio by 
minimizing jitter and delay (see Dowdal, col. 1 , lines 1 5-21 ). 

As to claim 13, Gentle in view of Dowdal teaches all of the limitations as in claim 
9, above. 

Furthermore, Gentle teaches said voice activity detector further comprises 
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an estimator to estimate energy level values (see [[0051], energy level 
measurement by VAD 220) (e.g. Energy levels are estimated.); 

a voice classification module connected to said estimator to classify 
information for said frame (see [0051], VAD 220 classifies based on silence or 
non-silence) 

7. Claims 2, 3, 1 2, 1 5, and 1 6 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Gentle in view of Dowdal, as applied to claims 1 , 9, and 1 4 above, 
in view of Clemm (US 6,865,1 62). 

As to claims 2 and 15, Gentle in view of Dowdal teach a voice based packet 
network. 

However, Gentle in view of Dowdal. does not specifically teach the 
buffering of a portion of said audio information in a pre-buffer for a predetermined 
time interval. 

Clemm does teach the use of a buffer (see col. 2, line 31 ) for a 
predetermined time (see col. 2, lines 31-33) prior to said determining (see Figure 
1 , elements 1 1 0 and 1 20 and col. 2, lines 30-37) (e.g. A pre-buffer is used.). 

It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have modified the voice based packet network as 
taught by Gentle in view of Dowdal with the buffer before the voice activity 
detector as taught by Clemm. The motivation to have combined the two 
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references involve the elimination of clipping associated with voice activity 
detector directed during silence suppression (see Clemm col. 2, lines 47-48). 

As to claims 3 and 16, Gentle in view of Dowdal teaches all of the limitations as 
in claims 1 and 13, above. 

Furthermore, gentle teaches sending said information from the jitter buffer 
to an end user (see Figure 3, second user, 312) (e.g. The applicant denotes the 
endpoint to be defined as the human user (see Applicant's Specification, page 8, 
[0018], lines 5-6). (Further, the sending of audio information to the user from the 
pre-buffer would have been apparent with the teaching presented by Clemm to 
avoid clipping). 

As to claim 1 2, Gentle in view of Dowdal teach all of the limitations as in claim 9. 

Furthermore, Gentle in view of Dowdal etal. teach a voice packet based 
network. 

However, Gentle in view of Dowdal do not specifically teach the buffering 
of a portion of said audio information in a pre-buffer for a predetermined time 
interval. 

Clemm teaches further comprising a buffer to store pre-threshold speech 
during detection by voice activity detector (see Figure 1 , elements 1 10 and 120 
and col. 2, lines 30-37) (The reference buffers a pre-threshold speech based 
upon two values, from a delay.) 
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It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have modified the voice based packet network as 
taught by Gentle in view of Dowdal with the buffer before the voice activity 
detector as taught by Clemm. The motivation to have combined the two 
references involve the elimination of clipping associated with voice activity 
detector directed during silence suppression (see Clemm ,col. 2, lines 47-48). 

8. Claims 8, 1 0, 1 1 , and 20 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Gentle in view of Dowdal as applied to claim 9 above, and further in 
view of Sih et al. (US 5,920,834). 

As to claims 8 and 20, Gentle in view of Dowdal teaches all of the limitations as 
in claim 1 and 14, above. 

Furthermore, Gentle teaches retrieving a frame (see Figure 2, output of 

212) of audio information from said packets (e.g. Audio information in the form of 

voice is received, which has undergone pulse code modulation); 

canceling echo from said frame of audio information (see echo canceller 

216); and 

sending said frame of audio information to a voice activity detector (see 
Figure 6, output of echo canceller 21 6 to input of VAD 220). 

However, Gentle in view of Dowdal do not specifically teach the receiving 
of an echo cancellation reference signal. 
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Sih does teach receiving an echo cancellation reference signal (col. 6, 
lines 14-18) and Figure 2, echo canceller 10, z'(n) is the reference signal.); 

It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have modified the voice based packet network as 
taught by Gentle in view of Dowdal with the use of a reference signal to cancel 
echo as taught by Sih for the purpose of noise suppression (see Sih, col. 3, lines 
5). 

As to claim 1 0, Gentle in view of Dowdal teach all of the limitations as in claim 9. 

Furthermore, Gentle in view of Dowdal teach a voice packet based 
network. 

However, Gentle in view of Dowdal do not specifically teach the echo 
canceller connected to a receiver to cancel the echo. 

However, Sih et al. does teach the echo canceller being connected to a 
receiver (see Figure 1 , elements 14 and 10) (e.g. It is evident that a transceiver 
consists of a receiver and a transmitter). 

It would have been obvious to one of ordinary skilled in the art at the time 
the invention was made to have the echo canceller connected to a receiver. The 
motivation to have combined the two references involves cancellation of echo for 
mobile phones that may occur in speech signals (e.g. see Sih et al., col. 23-25) 
as would have been apparent in the teachings of Gentle, which describes 
communication between telephony devices. 
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As to claim 1 1 , Gentle in view of Dowdal in view of Sih et al. teaches all of the 
limitations as in claim 9. 

Furthermore, Sih et al. teaches a transmitter (see Figure 1, element 14) 
(e.g. Transceiver consists of a transmitter) to provide an echo cancellation signal 
to said echo canceller (see Figure 1, element 10 and col. 6, lines 14-18). 

Conclusion 

9. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Shaffer et al. (US 6,707,821 ) is cited to disclose jitter minimization in a packet 
based system. Bhattacharya etal. (US 7,376,148) and Eckberg (US 2003/0202528) is 
cited to disclose minimization of jitter in a packet network. LeBlanc (US 2004/0057445) 
is cited to disclose jitter buffer in a packet voice system. El-Hennawey (US 
2004/0071084) is cited to disclose quality monitoring of voice based network. 

1 0. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to PARAS SHAH whose telephone number is (571)270- 
1650. The examiner can normally be reached on MON.-THURS. 7:00a. m.-4:00p.m. 
EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571)272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/Paras Shah/ 
Examiner, Art Unit 2626 

06/25/2008 

/Patrick N. Edouard/ 

Supervisory Patent Examiner, Art Unit 2626 



